Eecient Search of Reliable Exceptions
نویسندگان
چکیده
Finding patterns from data sets is a fundamental task of data mining. If we categorize all patterns into strong, weak, and random, conventional data mining techniques are designed only to nd strong patterns, which hold for numerous objects and are usually consistent with the expectations of experts. While such strong patterns are helpful in prediction, the unexpectedness and contradiction exhibited by weak patterns are also very useful although they represent a relatively small number of objects. In this paper, we address the problem of nding weak patterns (i.e., reliable exceptions) from databases. A simple and eecient approach is proposed which uses deviation analysis to identify interesting exceptions and explore reliable ones. Besides, it is exible in handling both subjective and objective exceptions. We demonstrate the eeectiveness of the proposed approach through a set of real-life data sets, and present interesting ndings.
منابع مشابه
Discovering Compressive Partial Determinations in Mixed Numerical and Symbolic Domains
Partial determinations are an interesting form of dependency between attributes in a relation. They generalize functional dependencies by allowing exceptions. We modify a known MDL formula for evaluating such partial determinations to allow for its use in an admissible heuristic in exhaustive search. Furthermore we describe an eecient preprocessing-based approach for handling numerical attribut...
متن کاملLocal Evolutionary Search Enhancement by Random Memorizing
| For the calibration of laser induced plasma spectrometers robust and eecient local search methods are required. Therefore, several local optimizers from nonlinear optimization, random search and evolutionary computation are compared. It is shown that evolutionary algorithms are superior with respect to reliability and eeciency. To enhance the local search of an evolutionary algorithm a new me...
متن کاملOn the Benefits of Random Memorizing in Local Evolutionary Search
For the calibration of laser induced plasma spectrometers robust and eecient local search methods are required. Therefore, several local optimizers from nonlinear optimization, random search and evolutionary computation are compared. It is shown that evolutionary algorithms are superior with respect to reliability and eeciency. To enhance the local search of an evolutionary algorithm a new meth...
متن کاملEecient Learning of Regular Languages Using Teacher-supplied Positive Samples and Learner-generated Queries 1 2
We present a new algorithm for eecient learning of regular languages from examples and queries. A reliable teacher who knows the unknown regular grammar G (or is able to determine if certain strings are accepted by the grammar) will guide the learner in achieving the goal of inferring an equivalent grammar G. The teacher provides the learner with a structurally complete set of positive examples...
متن کاملExtending Mehrotra's Corrector for Linear Programs
In this article a primal-dual interior-point method for solving linear programs is proposed. A new approach for generating higher-order search directions, and a new method for an eecient higher-order subspace search along several search directions are the basis of the proposed extension. The subspace search is reduced to a linear program in several variables. The method using the simplest (two-...
متن کامل